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An overview of the prospects of top quark physics at the LHC is presented. The ATLAS and the CMS detectors are 
about to produce a large amount of data with high top quark contents from the LHC proton-proton collisions. A 
wide variety of physics analyses is planned in both experiments, and a number of useful insights have already been 
obtained regarding their detector performance and physics potential. This summary is based on the talk presented at 
OO the Hadron Collider Physics Symposium 2008, Galena, Illinois, May 27-31, 2008. 

o 
o 

1. Introduction 

O 

1.1. Top Physics timeline at the LHC 

OO 

Even though its existence was predicted a few decades ago, and its discovery was made more than a decade ago, 
the top quark still plays a major role in the forefront of the experimental and theoretical challenges of high energy 
physics. At the LHC, the importance of the top quark can be recognized in a number of perspectives over the lifetime 
of the accelerator and the general purpose detectors on the ring, ATLAS and CMS. They are planned to commence 
operation later this year. In the past years, the two collaborations studied various aspects of the top quark physics 
using Monte Carlo event generators and detector simulation, not only testing the existing analysis methods developed 
at the Tevatron experiments, but suggesting a number of new ideas that have shown to be relevant and feasible at 
the LHC. This paper provides a brief review of these analyses and shows how they fit together in each phase of 
data-taking operation. The results were collected mainly from the Computing System Commissioning analysis in the 
ATLAS experiment [TJ [2] and the Technical Design Report from CMS [3, 4 with updates where available. 

o 
q 

od 1.2. Top physics: topics of interest and constraints at the LHC 

O 

There is a wide variety of accessible top physics topics that are of interest to experimentalists and theorists, using 
the LHC data. Some of them are illustrated in Figure [I] The production mechanism of the top quark at the highest 
energy regime of our reach is of the primary interest to us. The measurement of the total production cross-section 
will enable us to distinguish a number of theoretical predictions by itself. Using additional variables and studying 
differential cross-sections, we can further study the nature of the production vertex to search for existence of resonant 
structure or anomalous couplings. The properties of the top quark itself need to be measured as well. This includes 
mass, width, spin, charge and couplings to other particles including the Yukawa coupling to the Higgs boson. The 
top quark decays almost exclusively to a W boson and a b quark, though with a good amount of data we can probe 
the precise nature of the \V t b\ vertex and possibly an existence of rare decays. On the other hand, extracting such 
measurements is a challenging endeavor: events produced at hadron colliders are known to have a high multiplicity. 
Understanding the beam luminosity can be a tricky task. At high luminosity, estimation of pile-up will become a 
primary concern. The contribution of non perturbative QCD processes to the observed events has large uncertainties 
too. The parton contents of the beam (PDF), the amount of initial state radiation (ISR) and of the accompanying 
underlying event have to be taken into account and understood before the experiments can carry out precision 
measurements. 

Several experimental uncertainties affect the reconstruction of the events: a sound measurement of the jet energies 
can only be achieved after extensive calibration efforts. The missing transverse energy (EJp tss ) measurement requires 
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Figure 1: Diagrams illustrating the theoretical interests (left) and the experimental issues (right) related to top quark physics. 
The arrows indicate connection between the topics and the objects within ti events. 



a global understanding of the detector. Care needs to be taken to avoid possible bias from trigger requirements to 
the measured quantities. In addition, the understanding of the 6-tagging performance is a complex task in itself and 
can be affected by theoretical issues such as the b- fragment at ion too. 

During the lifetime of the experiment the analysis methods will evolve: early data analysis will use the most reliable 
information only. At this point known physics processes can be used to calibrate and better understand the detector 
in order to be able to perform more complex analyses. This applies to the case of top quark physics: its observation 
will be challenging since it typically requires jets and E^ lss , however once observed, the ti event topology has a 
number of applications that will help us understanding the detector. 



1.3. Tevatron and the LHC 

The LHC is literally a top factory. Its large center-of-mass energy, y/s — 14 TeV, will yield the ti production 
cross-section of 833 pb, larger than at the Tevatron, y/s — 1.96 TeV, by a factor of ~100. The design luminosity of 
the LHC is also a factor of 100 larger than the Tevatron luminosity, so statistics will quickly become a non-issue for 
many top quark measurements. At its design luminosity, one ti pair is produced almost every second at the LHC as 
opposed to 10 pairs per day at the Tevatron. 

In addition to the production rate, the ti production mechanism differs significantly. The proton- antiproton 
collisions at the Tevatron produce ti pairs predominantly (85% of the cases) through quark- ant iquark annihilation, 
while the majority of the ti production at the LHC (90% of the cases) is due to gluon-gluon fusion. At the LHC, the 
PDF uncertainty for gluon-gluon fusion process is much smaller than at the Tevatron while qq processes entail larger 
uncertainty. The total uncertainty on the ti cross-section due to PDF is estimated to be smaller at the LHC 16] . 



1.4. ATLAS and CMS 

The ATLAS and the CMS detectors were built primarily to discover the Higgs boson. Luckily, the energy scale for 
their performance optimization is highly desirable for top quark observation as well. The design of the two detectors 
is similar in that they have a cylindrical structure with inner tracking detectors, calorimeters, and muon systems 




Figure 2: Some of the main tt production diagrams. Gluon-gluon fusion diagrams (left two) are the main production mode at 
the LHC while the quark- ant iquark annihilation (right) is dominant at the Tevatron. 

with superconducting magnets in between. Many sub-detectors are currently being commissioned and calibrated 
with cosmic ray muons. 

In addition to detector commissioning, the commissioning of the computing facility is an ongoing effort in both 
collaborations. 

1.5. The LHC expectations in the first year 

According to the current schedule, the LHC accelerator will start operation in 2009 with center-of-mass energy less 
than the design beam energy (such as 5 TeV + 5 TeV) to avoid time-consuming commissioning required to achieve 
energy above 5.5 TeV. The initial luminosity will be lower than the design luminosity by a factor of 1000. The lower 
beam energy means that the ti cross-section will be reduced by half. This applies roughly to all background processes 
as well. Even with 3 months of uninterrupted data-taking, the integrated luminosity will be less than 100 pb _1 . The 
number of ti events will therefore be less than 40,000 in the first year of operation. The changes in the initial beam 
parameters were determined only shortly before this review was compiled. All studies presented here assume the 
design beam energy. 

2. Early data analysis - Physics commissioning and first measurements 

As explained earlier, at the beginning of the data taking the detector performance will need to be understood 
and therefore the top quark observation will not be the first measurement to be done at the LHC. However, the 
first observation of the top quark will be an important milestone indicating that physics analysis environment is 
well established. Thus, a number of analyses are in preparation aiming for the top observation with the initial data, 
followed by cross-section measurements. 

2.1. Dilepton channel 




Figure 3: Jet multiplicity for jets with pr > 30 GeV, normalized to 10 pb 



The dileptonic final state is a rare signature where both W bosons, produced by the decay of the top pair, 
decay leptonically (electron or muon), and can be triggered with very high efficiency. Although the cross-section for 
dileptonic channels is small, this is possibly the first place where the evidence of top events can be seen. The lepton 
fake contribution from QCD jets will be small thanks to the requirement the presence of two leptons in the final 
state. Therefore the main background comes from Drell-Yan (DY) production and diboson processes. In events with 
an electron and a muon in the final state, even DY events will not contribute to the background. Figure [3] shows the 
jet multiplicity in dilepton final states assuming 10 pb _1 of data, as obtained in a CMS analysis [7]. A pt threshold 
of 16 GeV and 17 GeV was used to trigger single electrons and muons respectively. The offline lepton selection 
required two opposite charge leptons with isolation criteria based on the calorimeter and the tracking. Further event 
selection simply removed events with low missing Et events and Z events were removed by applying a window cut 
on the dilepton mass. The tt signal can be seen clearly in the figure. The signal purity in e/i events is outstanding. 
It was estimated that, by counting the number of events with two or more jets, the cross-section can be measured 
with a 13% statistical uncertainty and a systematic uncertainty of the same order using 10 pb _1 of data. 



2.2. Semileptonic channel 

Measurements in the semileptonic (or "lepton plus jets") channels benefit from large cross-section (30% of the tt 
events decay into e//x+jets) and in this channel the top quark can be fully reconstructed from its hadronic decay 
products. Combinatorial ambiguity can be large as shown in Figure [4] (left), though a clear top mass peak is visible. 
B-tagging can reduce this combinatorial background significantly but is not used in the early-data analysis by ATLAS 
PQ. Instead, top quarks are reconstructed by taking the highest pt trijet combination assuming that the jets from the 
top decay tend to be collimated in a similar direction. To increase purity, it is required that there is at least one dijet 
combination with invariant mass close to the mass of the W boson. With the requirement of E™ lss >20 GeV and an 
isolated electron or muon with pT>2b GeV, the selection efficiency is 18.2% and 23.6% respectively. This is halved 
by requirement of W mass. The background can be estimated by fitting a Chebyshev polynomial function while 
obtaining the signal yield using a Gaussian. Precision of Aa/a = 7 (stat.) ± 15 (syst.) ± 3 (PDF) ± 5% (lumi) 
was estimated at 100 pb _1 . Although this data-driven method is robust against theoretical uncertainties, it is 
sensitive to the stability of the fit. An alternative counting method that uses Monte Carlo W + jets sample is 
suggested. This method is sensitive to uncertainty on the rate of additional jets, but a competitive precision of 
Aa/a = 3 (stat.) ± 16 (syst.) ± 3 (PDF) ± 5% (lumi) has been obtained, and it can serve as a cross check. An 
improvement to this analysis is to use a data-driven method to estimate the W + jets background, which is currently 
under study. 
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Figure 4: Left: mass of the highest pr trijet combination before applying W mass window cut. Right: E™ ss spectrum 
compared with a SUSY signal and other potential background to SUSY. 



3. Top as background to new physics 



The measurement of the ti cross-section is important since the NLO theoretical cross-section is highly sensitive to 
the QCD scale variation, estimated to be ~12% [8 j. There are ongoing efforts to complete full NNLO calculation in a 
near future, presumably reducing the uncertainty by a half. While LHC experiments are challenged to test the theory 
at this precision, the measurement is an important input to new physics searches, many of which seek for a final state 
topology much similar to top events. This includes among the others some Higgs searches (e.g. H — > tt, WH^tiH), 
supersymmetry searches and the twin Higgs model (e.g. Wh —> tb) among others. 

Many supersymmetry searches look for a large E™ lss accompanied by lepton(s) and jets. Signal events typically 
have much more missing energy than ti though the estimation of the background coming from the tail of the 
distribution is crucial to identify the signal with confidence [1 j. This is illustrated in Figure [Z] (right). The E™ %ss 
tail cannot be estimated reliably from detector simulation, and a number of data-driven methods are being tested. 
Such method requires a well-understood control region rich in background but sparse in signal. For this reason, one 
proposed method relies on well reconstructed top events to estimate the contamination in the signal region pQ. 

Obtaining a suitable control region can be problematic depending on the signal model sought after. Signatures of 
some SUSY parameter space can overlap largely with ti such as SU4. In this scenario, the event yield from SUSY 
in the above-mentioned semileptonic ti analysis can be as large as one fifth of ti. It is therefore crucial to measure 
ti cross-section by a variety of methods to ensure that consistency is observed in a largest possible phase space. 

4. Top as a candle in the dark 

Once the first observation of the top quark is established with first few fb _1 of data, and the consistency with the 
Standard Model is confirmed, ti events can serve as a standard reference point. New methods have been developed 
that are useful to deal with detector performance issues that are otherwise difficult to understand. 

4.1. Efficiency of Magging 

Since a good number of ti events can be obtained with a reasonable purity without using 6-tagging, such a sample 
can be used to measure the 6-tagging performance. One conventional method uses dijet events in which one of the 
jets is tagged using soft lepton tag. However the 6-tagging performance might vary in events with higher pt jets 
and larger jet multiplicity. To extract the 6-tagging efficiency in such environment ti events can be used, even if the 
production rate is much smaller than dijet events. 

One simple method is to count the number of events with different 6-tagged jet multiplicity pQ. To obtain the 
efficiency, the multiplicity distribution estimated from Monte Carlo simulation including expected background events 
is fitted to the observed distribution. This method relies on detector simulation and only the integrated efficiency 
can be measured. Precision of is expected with 100 pb _1 of data. 

Another method tries to identify a pure sample of 6-jets by exploiting the tt event topology such as the reconstructed 
top mass. However, due to the existence of combinatorial and other physics backgrounds, one can only extract 
6-tagging performance on a statistical basis subtracting the background distribution from the histogram. Once 
background is properly subtracted, it is then possible to obtain the shape of 6-tagging discriminant variable. With it, 
6-tagging efficiency can be measured as a function of the selection criteria as shown in Figure [5] (left). This method is 
statistically less robust compared to the first method, though with enough data, it can be extended to include other 
variables such as pt and rj. 

4.2. Jet energy scale 

The jet energy calibration is crucial for almost all analyses at the LHC including the top mass measurement. 
However, the precise measurement of jet energy scale (JES) and jet energy resolution is known to be a non-trivial 
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Figure 5: Left: The 6-tagging discriminant variable distribution estimated using enriched fr-jet sample from ti events at 100 
pb _1 (points) and the true distribution (line). Right: reconstructed top and W mass as a result of ensemble testing at 1 fb _1 
before and after the in-situ calibration. 



task. In ti events, one can use the non-6-tagged jets from the top decay since they are known to originate from 
the light quarks from the W decay. The current method uses a number of template models with varying scale and 
resolution [1 as well as a kinematic fit on the ti event topology to purify the candidate jets [9]. The jet energy scale 
can be obtained by fitting the templates to the data. This calibration can be performed in-situ in semileptonic ti 
events and can improve the stability of top mass measurement significantly as shown in Figure [5] (right). 

There is a remaining issue with the jet energy scale when the jet originates from a b quark. In this case an 
additional correction is necessary to take into account a larger fraction of energy loss outside the jet cone than in the 
case of light jets. A method has been explored that extracts 6-jet energy scale (BJES) independent of the top mass 
[TO] though it has not been proven to be viable. The current best method is to fix the top mass to the known best 
value but this is clearly unfavorable. 



5. Top property measurement 

With increasing amount of data accumulated at the LHC after the early data-taking phase, precision measurements 
of the top quark properties are expected to provide new insights into the electroweak symmetry breaking mechanism. 
For its heavy mass, the top quark properties may be the first indication of the existence of new physics. 



5.1. Top mass measurement 

One of the most important variable to be measured is the top mass. The precision achieved at the Tevatron is 
already nearing an impressive 1 GeV [11 . The LHC experiments will be much less limited in terms of statistics, 
though the same types of systematic uncertainties will exist, being typical for hadron colliders. A number of mass 
measurement methods have been studied, some are already used at the Tevatron and some are new. The most 
prominent "Golden" channel is the semileptonic decay mode where the top quark can be clearly reconstructed. 
Combinatorial and physics background (mainly W + jets) can be reduced with 6-tagging and tight cuts on the jet 
Pt- Instrumental background contamination is reduced by requiring large EJp zss and an isolated high-p T lepton. A 
kinematic fit can fully exploit the over constraints that exist in semileptonic ti decays, though a simpler method has 
shown to be as competitive and robust. In this method, the top candidate is reconstructed by combining the three jets 
nearest to each other in angle and purified using the W mass constraint. Requiring the invariant mass of the hadronic 
W and the leptonic 6-jet to be greater than 200 GeV and one of the lepton and the leptonic 6-jet to be smaller than 




Figure 6: Left: Reconstructed hadronic top mass distribution in the semileptonic decay mode. Right: the correlation between 
the tri-lepton invariant mass and the top mass in J/ip + lepton mode. 

160 GeV, a purity of the sample of 78% can be reached within 3 sigma from the reconstructed top mass. The mass 
distribution after the final selection is shown in Figure [6] (left). The corresponding efficiency is 0.82%. The mass 
extracted from a Gaussian fit is 174.6 ± 0.5 {stab.) ± 0.2 (per 1% JES) ± 0.7 (per 1% BJES) ± 0.4 (ISR/FSR) 
for the input top mass of 175 GeV. The in-situ jet energy calibration mentioned in the previous section yields 1% 
precision with 1 fb _1 of data but the uncertainty on the fr-jet energy scale may be much larger. 

The mass measurement based on jet energy is sensitive to potentially large energy scale uncertainties as discussed 
above. A method that relies entirely on the lepton information is suggested by CMS [3 . The method requires 
semileptonic events where the b quark from the top decay subsequently decays into J/ip after forming a B meson. 
The branching ratio for this decay is of the order of 5.5 x 10 -4 and it requires a very large amount of data. The 
extraction of the mass relies on Monte Carlo simulation from which the correlation between the peak of the invariant 
mass of the three leptons and the mass of the top quark is extracted as shown in Figure [6] (right). The statistical 
uncertainty with 20 fb _1 is ~1 GeV and the systematic uncertainty is dominated by the uncertainty of the MC 
generator parameters, estimated to be 1.5 GeV, using TopRex[T2] and Pythia[13] generators. 

5.2. tt spin correlation 

One interesting property of tt production is the spin correlation between t and t. The double differential cross- 
section reveals the correlation as follows: 

1 d 2 N 1 

— — = -(1 — Akik q ) cos 07 cos0 o (1) 

N dcosOidcosOq 4 V qJ q w 

and it is observable using the decay products (lepton, 1, for leptonic top, light jet, q, for hadronic top) as spin 
analyzers (with analyzing power k) of either top in their helicity basis. Standard Model predicts a correlation A 
of 0.32. A good understanding of the bias in measured angles is necessary to interpret the observation and the 
current method relying on detector simulation estimated precision of 17% with 10 fb _1 . Spin correlation becomes 
particularly relevant if a tt resonance state is discovered. Theoretical predictions can be distinguished based on this 
information, which reflects the spin of the resonant particle. 

6. Single top measurement 

At the LHC the production process of a single top in the final state has a cross-section of about 1/3 of the tt 
production. Like ££, the production rate of the single top processes will also be larger by a factor of 100 or so compared 




Figure 7: Feynman diagrams of the leading order single top processes. From left: t-channel, s-channel and tW associated 
production. 



to the Tevatron. The dominant t-channel process has a cross-section of 244.6 pb, the second largest £W-channel of 
62.1 pb and the s-channel of 10.65 pb. Figure [7] shows the Feynman diagrams of each process. 

The single top production provides vital information about the top quark, which complements our knowledge 
gained from the ti process. It is initiated via the weak interaction between quarks, unlike ti, which is a manifestation 
of strong interaction. This implies that the study of the single top processes will enable us to test the Standard 
Model from a different perspective leading to a universal understanding of the particle. 



6.1. t-channel single top 




Number M (GeV) 

Figure 8: Left: Jet multiplicity distribution of selected t-channel sample. Right: top mass distribution after event selection 
using Boosted Decision Trees. Both at 1 fb _1 



The t-channel process has by far the largest cross-section and its production rate is roughly proportional to |T4&|, 
a quantity not directly measured elsewhere. The parity violating V+A coupling of the weak interaction can be 
tested in this channel by measuring the polarization of the top quark using its decay products. Extracting the 
signal in the t-channel single top (and in fact in all others) is challenging due to the large background contribution 
coming from ti and W + jets. This is illustrated in Figure [8] (left). It was shown in the Tevatron analyses that 
multivariate background rejection techniques are highly effective to purify the single top sample and this has also 
been demonstrated in ATLAS pQ. In addition to the cut based selection using isolated lepton, EJp zss , 6-tagging and 
jet multiplicity cuts, additional 12 variables were combined using the Boosted Decision Tree method. As a result, a 
clear signal can be seen even with 1 fb _1 of data as shown in Figure [8] (right). With 10 fb _1 , a 10% precision on the 
measured cross- section has been estimated including systematic uncertainties, which translates into a 5% uncertainty 
on \Vtb\ measurement. 



Figure 9: Event display of tt event where pr of the top is around 150 GeV (left) and 250 GeV (right). 



6.2. Top-Vl^ associated production 

Most current single top studies at the LHC rely heavily on Monte Carlo simulation. Further effort is clearly 
necessary to reduce this aspect using a more data-driven method like the one found in a CMS analysis [3 shown 
here. The associated top-W production has a final state extremely similar to tt. To estimate the contamination from 
this background, a tt enriched control region is defined by requiring an additional jet. The ratio of the number of 
events in the signal and the control regions is calculated from MC for the signal and the tt background separately. It 
is then possible to extract the number of signal events by solving equations relating the number of observed events 
in each region in terms of these ratios as follows: 

s = Ru(N a - JVg) - (N c - JVg) 
Ru — Rtw 

B = (N c -N°)-R tw (N s -N°) | N , (3) 

Ru — Rtw 

Where R are the ratios as mentioned above, N s (N c ) is the number of observed events in the signal (control) region 
and Ng (AT?) is non-ti background in those regions. Non-tf background is still estimated entirely from MC. With 
10 fb _1 of data, the estimated uncertainties on the measured cross-section of the semileptonic decay mode are 
Acr/cr = ±7.5% (stat.) ± 16.8% (syst.) ± 15.2% (MC stat.). 



7. Top as signature to new physics - tt resonance and boosted top 

As well as being background to new physics, the top quarks can themselves be a signal of new physics. Alternative 
(non-Higgs) models of electroweak symmetry breaking tend to involve resonances that couple strongly to the top 
quark, and therefore top is often called "the best probe for EWSB" in this respect. For example interactions predicted 
by theories such as p~4j [TSJ [16] : 

pp — > X — > tt (Extra dimensions with resonance X) (4) 

pp — > b'V — > W~tW + i (Extra generation b') (5) 
pp — > g g —>gt gi (Top color with new gauge boson g) (6) 

can produce top quarks in the final state. 



At the simplest level, the measurement of the di-top system may reveal a resonance structure by fully reconstructing 
the tt event. However, improvements on resolution will take more efforts. A kinematic fit can improve sensitivity in 
lower mass regions. On the other hand, if the resonance is located at a very high mass, the resulting top quarks will 
be highly boosted. Under such conditions, the top decay products start to collimate to form a single "top-jet" in an 
extreme case. 

Figure [9] illustrates how this occurs. While lower top quarks spread its decay products widely in the detector 
making it difficult to select the correct combination of the objects, when they have a larger boost, the decay products 
can be more easily assigned to each top quark. With an even higher boost with pt of the top above 300 GeV, 
however, it starts to become impossible to separate all decay products. It then becomes necessary to look into the 
substructure of these merged jet objects to distinguish them from high pt jets originating from non-resonant QCD 
processes. New methods are under development to achieve this discrimination to improve the efficiency of the signal 
in the very high energy regime. For example, "Y-scale", which is used in Kt jet algorithms to determine whether to 
merge two energy clusters, can be applied to the clusters within a jet to measure the energy scale at which the jet 
would split [I7l [18]. A jet containing two clusters originating from a heavy resonance would have high Y-scale while 
non-resonant QCD jets have much lower splitting scale. Other signatures such as displaced vertex 6-tagging are also 
under investigation. 



8. Summary 

A brief overview was given to summarize the prospect for top quark physics at the LHC. The soon-to- arrive collision 
data will provide unmissable opportunity to the field and may lead to significant new discoveries. Both ATLAS and 
CMS have plans for studying the full extent of the top quark production mechanism and the top quark properties. 
Simulation studies have concluded that the top quark can be observed in an early stage of the experiments and the 
proposed new analyses methods are feasible at the LHC. It is hoped and is likely that new insights will be gained 
from top quarks throughout the entire lifespan of the experiments. 
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